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Abstract 

For a large class of statistical systems a geometric mean value of the ob- 
servables is constrained. These observables are characterized by a power-law 
statistical distribution. 



In everyday life we find events of very different nature often to follow 
similar statistical distributions. These distributions arise because the same 
stochastic process is at work, and this process can be understood beyond 
the context of each example. Who is responsible for the particular forms of 
these distributions? This is a fundamental problem. The least bias approach 
to this problem was promoted by E.T.Jaynes about half century ago as a 
Maximum Entropy Principle (MEP) [1 . The MEP states that the phys- 
ical observable has a distribution, consistent with given constraints which 
maximize the entropy. It is important that the a priori given constraints 
are defined by a macroscopic mechanism the observable is constructed. If 
the observable values are restricted to be within a limited interval only, the 
resulting statistical distribution is a uniform distribution. A maximization 
of Shannon-Gibbs entropy with a constraint on the observable's arithmetic 
mean value results in an exponential distribution. Examples of the observ- 
ables constructed such a way that an arithmetic mean values are constrained 
are following. 
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a) Time until the next event, given N events are uniformly distributed over 
a time period T. In this case the arithmetic mean value is constrained 
to be T/N. 

h) energy of an ideal gas molecule has the arithmetic mean value con- 
strained to be E/M, provided the isolated gas volume contains M 
molecules with total energy E. 

c) Time until a radioactive particle decays, with the arithmetic mean value 
given by the nature of the decaying particle. 

It is well known, the exponential distributions are observed for a)-c). Note, 
that the given examples haven't any other constraints than the arithmetic 
mean. An additional constraint would distort the exponential distribution 
form. Consider a set of unbiased measurements of a rod length by a ruler. 
The arithmetic mean value of these measurements is defined by true physical 
length of the rod and, in addition, the variance is defined by accuracy of the 
ruler. A maximization of Shannon-Gibbs entropy with constraints on the 
arithmetic mean and the variance results in the normal distribution, as it 
is usually observed for a set of repetitive measurements. Ubiquity of expo- 
nential distributions in the Nature implies that a vast class of processes has 
only one intrinsic constraint which is the arithmetic mean of a correspondent 
observable. 

However, the exponential distribution is not only ubiquitous one in the 
Nature. Power-law distributions occur in an extraordinary diverse range of 
phenomena in physics, biology, geography, economics, lexicography, comput- 
ing, ecology, etc. The examples of observables having a power-law behavior 
are: a settlement population, people's annual incomes, the sizes of earth- 
quakes, avalanches and landslides, the forest fires and solar flares, computer 
files, trades on a stock market, the occurrence of words in a book, the sales 
of commodities, a number of sexual contacts, particle energy in a geomag- 
netic plasma, hadrons produced in high energy particle interaction, etc 
In order to obtain a power-law distribution from MEP one has to constrain 
an average logarithm of the observable value, or, equivalently, a geometric 
mean value. This seems to be odd since the use of geometric means is not 
generally anticipated. That is why it becomes increasingly attractive to de- 
rive the power-law distributions by exploiting MEP for non-extensive form of 
entropy with a traditional constraint on an arithmetic mean value [3]. We ar- 
gue below that for a vast class of phenomena having a power-law statistics an 



2 



arithmetic mean isn't defined from tlie first principles, while an application 
of a geometric mean constraint is rather logic. This has been already pointed 
out in 19th century by Francis Galton who has stressed an importance of ge- 
ometric means for vital and social statistics |3]. Let us consider below some 
selected examples in which geometric mean values get constrained. 

I. Population of settlements. Since 1913 it was observed that a distri- 
bution of the number of cities as function of it's human population follows 
a power-law [Sj. A growth (decay) of cities is a complicated process which 
generally depends on political and economical situation, human migrations, 
birth (death) rate, etc. However, on a macroscopic level these factors are 
defined in terms of a relative rate of increasing (decreasing) population, i.e. 
a ratio of a city population change (Ax) over certain time period (At) to the 
city population (x). An average of this ratio could, in principle, be modeled 
a priori independently of a city size. On the other hand the absolute value 
Ax in not constrained and strongly depends on the city size x. 

II. Frequency of occurrence of words in a book. Suppose that the words 
in a book are ranked according to their number of occurrences, with the 1st 
rank corresponding to the most frequent word. In early 1930s, G.K.Zipf has 
noticed that for large ranks the number of word occurrences has a power-law 
dependence as function of the word's rank This remarkable observation 
holds true for different languages, authors and styles and is often termed 
as Zipf's law. A quantitative explanation of Zipf's law was later provided 
by H. A. Simon as a text generation model [7]. According to this model a 
probability to add a new or an already used word is a context dependent. 
The words that have been already used will be added to the text with a 
frequency (An) per text unit, say line, (A/) proportional to the number of 
its previous occurrences. Thus, the model of the text generation defines a 
characteristic ratio of the word occurrence increment An/ A/ over the prior 
occurrence n. 

III. People's annual income. The distribution of people's income in all 
countries and cultures has a power-law form. This phenomenon is called 
Pareto law after the Italian economist Vilfredo Pareto, who has first reported 
it in the end of XIX century jH]. A toy model of wealth exchange within a 
society suggests that one invests each time (to gain or to loose) in average a 
certain fraction of one's income. After some iterations the number of poor 
people will rise up while a small fraction of rich people will build up a long tail 
of the distribution towards enormous richness. The poor get poorer and the 
rich get richer, but in case people are bind to invest in average a predefined 
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fraction of their personal income the final distribution of people's income has 
a power-law form. Contrary to that, if people exchange a random amount of 
wealth, which is only limited by the size of income the final distribution of 
people's income is exponential 0. 

IV. Sexual contacts. The investigations of the nature of the web of human 
sexual contacts aimed at preventing the spread of sexually-transmitted dis- 
eases was undertaken in Sweden PU]- The survey involving a random sample 
of 4781 Swedish individuals demonstrated that the distributions of the num- 
ber of sexual partners during the twelve months prior to the survey decays 
as a power law with similar exponents for females and males. The models for 
building up the web of sexual contacts deal with an average frequency to take 
a decision to change the sexual partner. This frequency is different for differ- 
ent individuals and depends on the personal experience. The more partners 
(n) one has changed in the past, the higher probability (dn/dt) to find a new 
partner in the future in average. Since a characteristic ratio < dn/dt/n > 
could vary for different social groups, the reported observation was restricted 
to a certain age range with prostitutes excluded. 

The above examples (I-IV) have a common feature: the evolution of 
the physical observables is a statistical process which is determined by a 
probability (P) as function of a relative change of the observable per an 
evolution step. This distribution has a characteristic scale of the evolution 
defined by 




Therefore, for a statistical ensemble of observable values {xi) 
1 ^ 1 dx- 1 ^ d 

- y = - y -log(xi) = a. (2) 

After adding up many subsequent evolution steps one finds 

1 ^ 

-Y.log{x,:)=A + a-T, (3) 
i=i 

and 

N i_ 

(n ^) = ('MA + a-T). (4) 

i=l 

Here, for simplicity, the parameter a is assumed to vary slowly with time. 
In fact, if at the end of the evolution the final probability distribution be- 
comes stable, the parameter a turns to zero as it is expected for a system 
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at equilibrium. Thus, for the discussed class of the statistical systems a ge- 
ometric mean is constrained by the system's evolution prescriptions, whilst 
an arithmetic mean or, equivalently, a characteristic scale of < dx/dt >, has 
a little sense and could be only deduced from the measurements. From this 
point of view, the attempts to derive the power-law statistics for I-IV by 
exploiting MEP for non-extensive form of entropy with a constraint on an 
arithmetic mean value would look unjustified. Note, if the final distribution 
of the observable x follows exactly a power-law, then the geometric mean 
is the only one constraint which is available. If so, the MEP suggests the 
probability distribution P in (Q) to be exponential. Indeed, it is likely, for 
the words with a given frequency of occurrence in a book a text length until 
the word occurs next time has an exponential distribution. Similarly, for a 
particular category of individuals who change a given number of sexual part- 
ners per year a waiting time until the next partner is also likely to have an 
exponential distribution. Thus, for the discussed above examples (I-IV) one 
could generate the dx/dt distribution using an exponential distribution (e) 

^ = x-e. (5) 

dt ^ ' 

Contrary to that, for a class of observables having an exponential distribution 

with a constant model parameter h. In this case the evolution of the system 
is determined by an absolute change of the observable per an evolution step. 
This evolution mode is realized, e.g.^ in the radioactive atom decays, where 
each time the decay probability is independent of how long the atom survived 
before. There is no good reason to forbid a mixed evolution model with 

— = x ■ e + ■ e . (7j 

For such mixed model the arithmetic mean value is 

, 1 dx 



and, therefore, a corresponding constraint on x is defined as 

(/o^(6 + a;)) = A + a-T. (9) 
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A maximization of Shannon-Gibbs entropy with the constraint results in 
a damped power-law probability distribution 



For small values of x the damped power-law function ()10|) is approximated 
by exponential distribution, while for large values of x it has a power-law 
behavior. Contrary to the power-law distribution the damped power-law 
function is not divergent at low x values. This mixed behavior seems to 
be favorable in the nature. Many observables which initially are found to 
exhibit a power-law behavior later, after the accurate measurements at small 
X values have been performed, turned out to follow very closely a damped 
power-law distribution. So, in jTI] the data on the word frequency occurrence 
are fitted to the damped power-law function. This implies a correction to the 
mentioned above toy model of text generation. It is plausible that the most 
frequent "service" words (like "a", "the", etc) have to be added to the text 
with a constant probability independent of the context of the book. Among 
other numerous examples of the observables which are amazingly well follow 
a damped power-law distribution are trade sizes ^2] and inter-trade time 
intervals ^31 on stock markets, hadron production in high energy particle 
collisions and particle energy in a geomagnetic plasma ^3]. For the 
latter the damped power-law distribution is termed a "kappa-distribution". 

Finally, there are events of vital importance found to follow a power-law 
or damped power-law statistics. A mechanism of the most these phenom- 
ena is still poorly understood. Those phenomena are the trades at stock 
markets O [13] , global terrorism , extinction of biological species [T7] , 
etc. For these and many other events with empirically known statistical dis- 
tributions the MEP provides important information about the constraints 
at macroscopic level and, therefore, is helpful to understand the intrinsic 
mechanism of the phenomena. 

The author would like to thank A.Kropivnitskaya and T.Tokareva for 
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